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'.Title: ' An' in vitro rrethcd for construction of a DNA library 



5 FIELD OF THE INVENTION 

The present invention relates to optimizing DNA sequences 
in order to (a) improve the properties of a protein of interest 
by artificial generation of genetic diversity of a gene encoding 
the protein of interest by the use of the so-called gene- or DNA 

10 shuffling technique to create a large library of "genes", ex- 
pressing said library of genes ■ in a suitable expression system 
and screening the expressed • proteins in respect of specific char- 
acteristics to determine such proteins exhibiting desired proper- 
ties or (b) improve the properties of regulatory elements such as 

15 promoters or terminators by generation of a library of these ele- 
ments, transforming suitable hosts therewith in operable conjunc- 
tion with a structural gene, expressing said structural gene and 
screening for desirable properties in the regulatory element. 

20 

BACKGROUND OF THE INVENTION • 

It is general!:/ found that a protein performing ' a certain 
bioactiyity exhibits a certain variation between genera .and even 
between members of the same- species difference's" may'"exist. This 
25 variation is of course even more outspoken at the genomic level. 

This natural genetic diversity among genes coding for pro- 
teins having, basically the sane bioactivity has "been generated in 
Nature over billions of years and reflects a natural optimization 
of the proteins coded for m respect of the- environment or the 
30 organism in question'. ■ ■ * ' 



A 
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In today's society the conditions of life are vastly re- 
: moved from the natural environment and it has been found that the 
' naturally occurring bioactive molecules are not optimized for the 
1 'various uses to which they are put by mankind, especially when 
5 they are used for industrial purposes. 

It has therefore been of interest to industry to identify 
such bioactive proteins" that exhibit optimal properties in re- 
spect of the use to which it is intended. 

This' has for many years been , done by screening of natural 
10 ''sources, or by use of mutagenesis. For instance, within the tech- 
nical field of enzymes for 'use in e.g. detergents, the washing 
and/or dishwashing performance . of e.g. naturally occurring pro- 
teases, lipases, amylases and cellulases have been improved sic-, 
nificantly, by 'in vitro modifications of the enzymes. 
15 ■ in most cases these improvements have been obtained by 

site-directed mutagenesis resulting in substitution, deletion or 
. insertion of specific amino acid residues which have been chosen 
' either on the basis of their type or on the basis of their loca- 
■ ..' tion in the secondary or tertiary structure of the mature enzpe 
20 (see. for instance U.S. patent no. 4,518,5-84) . 

, In this manner the preparation of novel polypeptide vari- 
ants ' and .* mutant s , sucn as novel modified enzymes with altered 
characteristics, e.g. specific ' activity, -substrate specificity, 

- thermal, pH and salt stability ,\ pH-optimuni, pi, K m , V max etc *' 
25 ' has" successfully been performed to obtain polypeptides with Lm- 

- ; : proved properties. , . 

For instance, within the technical : field of enzymes the 
washing and/or dishwashing performance of e.g. proteases, 
pases, amylases and cellulases have been improved significantly 



1 1 — 
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An alternative general approach for modifying proteins and 
enzymes has been based on random mutagenesis, for instance, as 
disclosed in US 4,894,331 and WO 53/01285 

As it is a cumbersome and time consuming process to obtain 
5 polypeptide variants or mutants with improved functional prop- 
erties a few alternative' methods for rapid preparation of modi- 
fied polypeptides have been suggested. 

Weber et al., (1983), Nucleic Acids Research, vol. 11, 
5661, describes a method for modifying genes by in vivo recom- 
10 bination between two . homologous genes. A linear DNA sequence com- 
prising, a plasmid vector flanked by a DNA sequence encoding al- 
pha-1 human interferon in the 5' -end and a DNA sequence encoding 
alpha-2 human interferon in' the 3'-end is constructed and trans- 
fected into a fee A positive strain of £. coli. Recombinants were 
15 identified and isolated using a resistance marker. 

Pompon et al., (1989), Gene 83, p. . 15-24, describes - a 
method for shuffling gene domains of mammalian cytochrome P-450 
.by in vivo recombination of partially homologous sequences in 
Saccharo~yces cerevisiae by- . transforming Saccharonyces cerevisiae 
20 v; i z h a linearized plastic with fiiied-in ends, and a DNA fragment 
being partially homologous to the ends of said plasmid. . 

In WO 97/07205 a method is described whereby polypeptide 
variants are prepared by shuffling different nucleotide sequences 
of homologous DNA sequences by in vivo recombination using plas- 
25 micic DNA as template. 

US patent no. 5,093,257 (Assignee: Genencor Int. Inc.) dis- 
closes ' a method for producing hybrid polypeptides by in vivo re- 
combination. Hybrid DNA sequences are produced by forming a cir- 
cular vector' comprising a replication sequence, a first DNA se- 
30 cuence encoding the amino- terminal portion of the hybrid poly- 
peptide, a second DNA sequence encoding the carboxy- terminal por- 
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tion of said hybrid polypeptide . The- circular vector is trans- 
formed into a rec positive microorganism in which the circular 
vector is amplified. This results -in recombination of said circu- 
lar - vector mediated by the naturally ' 'occurring reccnbinaticr. 
5 mechanism of the rec positive microorganism, which , include prc- 
karyotes such as Bacillus^ and E. coli, and. eu'karyotes such as 
Saccharomyces cerevisiae. 

One method for the/shuffling of homologous DNA sequences 
has been described by Stemmer (Stemmer, -(1994)/ Proc. Natl.. Aca:. i; 

10 " Sci. USA, Vol. 91; 10747-10751; Stenmer, (1994)-, Nature, vol. 
370, 389- 391) . The method concerns shuffling homologous DN'A se- 
quences by using in vitro PCR* techniques. Positive recombinant 
genes containing shuffled DMA sequences are selected from a -D!CA. 
library based oh the improved . function of the expressed proteins. 

15 The above method is also described in WO 95/22 625 . K0 

95/22625 relates to a method for shuffling of homologous DNA se- 
quences. An important step' in the method described in WO 95/22625 
is to cleave the homologous template double-stranded .polynucleo- 
tide into random fragments of a desired size followed by homolo- 

20 cousiy reassembling of the fragments into full-length genes. 

A disadvantage inherent to the method_ of WO 95/22625 is, 
however, that the diversity generated through that method is lim- 
ited, due to the use of homologous gene sequences (as defined m 
WO 95/22625) . ' • 

25 Another disadvantage in the method of WO 95/22625 lies in 

the production of the random fragments by the cleavage of the 
template double-stranded polynucleotide.' 

A further reference of interest is WO 95/17413 describing a 
method of gene or DNA shuffling by recombination of specific CNA 

30 sequences - so-called design elements ( DEI) - either by recombina- 
tion of synthesized double-stranded fragments or recombination cr 
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PGR generated sequences', to .produce so-called functional' elements • 
(FE) comprising at -least two of the design elements. According 
to the method described in WO 95/17413 the recombination has to 
be oerformed among design elements that have DMA sequences with 
sufficient sequence homology to enable hybridization of the dif- 
ferent sequences to be recombined. 

WO 95/17413 therefore also entails the disadvantage that 
the diversity generated is relatively limited. Furthermore the 
method described is time consuming, expensive, and not suited for 
automatization . 

Despite the existence of the above methods there is still a 
need for better iterative in -vitro recombination methods for pre- 
paring novel polypeptide variants.. Such methods should also be 
capable of being performed in small volumes, and amenable .co 

automatisation . . 

SUMMARY Of THE INVENTION 

The present invention concerns ' brief ly ' the utilization; of 
t^oiate shift of a newly synthesized DMA strand curing in vitro 
DMA svnthesis in order to achieve DMA . shuffling . By using this 
unique it is possible to obtain such results in a more expedi- 

^ c— cv-^nr ev-n a create: variation than in 
■ .ent manner, anc to so^e .-.-.^en. ^ - — 1 c 

the above mentioned methods.. . . .. . . . . _ 

The method of the invention is also- very well suited for 
. adaotion to automatisation . . 

In a preferred errioci~.er.t the technique is used in combina- 
tion with an error-prone polymerase thereby introducing an. even 
Greater variation in the library created. 

More soecificaliy the present invention relates to a method 
for- the construction or a library of recombined homologous 
oolvnucleotides from a nur-r of different _ starting single or 
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double stranded parental' DNA templates and primers by induce 
template shifts during an in vitro polynucleotide synthesis usir. 
a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing parental double stranded DNA templates t- 
produce single stranded templates, 

b) annealing said primers ' to the single stranded DNA tem- 
plates, 

c) extending said primers by initiating synthesis by use zz 
said polymerase, 

d) cause arrest of the synthesis, and 

■ e) denaturing the double, strand to separate the extended 
primers from the templates, 

B. a template' shift is induced by 

a) , isolating the newly synthesized single stranded extend-: 
primers from the templates and . repeating ■ steps A.b) -to 
A.e) using said extended primers produced in (A) as both 
primefs and . templates , or ■ 

b) repeating steps A.b)' to A.e), ' 

C. the above process is terminated ' after an appropriate' numr-: 
of cycles of process steps A. and B.a), A. and B.b), or cc=> 
binations thereof, and 

D. optionally the produced polynucleotides are amplified in 
standard PCS reaction .with specific primers to selective! 
amplify homologous polynucleotides of interest. 

In specific -embodiments various modifications can 'be mad 
in. the process of the invention. For example it is advantages 
to apply a defective polymerase either an error-prone polymeras 
to- -introduce mutations ir. comparison to the templates, or a =c 
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lymerase that will discontinue the polynucleotide synthesis pre- 
maturely to effect the arrest of the reaction. 

Further modifications will be described below. 

In a further aspect the invention relates to a method of 
identifying polypeptides exhibiting improved properties .in com- 
parison to naturally occurring polypeptides of the same bioactiv- 
ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
to be expressed into the corresponding polypeptides and dis- 
played, said polypeptides are then screened in a suitable assay, 
and positive results selected. . - 

In a still further aspect the invention relates to a method 
for 'producing a' polypeptide of interest as identified in the pre- 
ceding process, whereby a vector comprising a polynucleotide en- 
coding said polypeptide is transformed into a suitable host, said 
host is grown to express said polypeptide, and the polypeptide 
recovered and purified. 

'. Finally, in further final aspects the invention 
relates to a recombinec/shuf f led protein, which is obtainable by 
any of the methods according to the invention, and which is a 
recombined/shuf fled protein comprising the ■ sequences disclosed 

herein (vide infra) . 

In those final aspects of the invention, the ten?, 
"obtainable" denotes that said protein is preferable obtained by 
a method according to the invention. However a prior art known 
recombination/shuffling technique such as those described in WO 
95/22625 or WO 95/17413 may be used too, either alone or in 
combination with a method according to the invention, in order to 
. obtain said recombined protein. 

Accordingly, further final aspect of the invention are; 
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a recombined/shuff led protease obtainable . by. any of th 
methods according to. the invention, and comprising a recombinec 
sequence, which at least contain two diff erent * partial sequences 
from at least two different wild-type proteases; 

a recombined/shuff led - lipase obtainable ' by. any of the 
methods according to .the invention, and comprising a 'recombinec 
sequence, which at least contain two different partial wild-type 
sequences ' t 

a recombined/shuffled' Pseudomonas lipases obtainable by any 

of the methods according, to the. invention, and comprising a 
recombined sequence, which at least . contain two different partial 
sequences from * at' least two'' of the •different' wild-type. 
Pseudomonas lipases; 

a recombined/shuf f led. xylanase obtainable by any- of the 
methods according to the invention, " and comprising a recombined 
sequence, which at least contain two different partial sequences 
from at least two 'different wild-type xylanas.es;. 

■ 'a recombined/shuffled cellclase obtainable by any of the 
methods according to the invention, and comprising a ■ recombinec 
sec^ence, which- at ' least contain two different partial sequences' 
dr.em at least two different wild-type celiulases; - , 

' - a- .. ■recombined/shu: fled, amylase obtainable by any of the 
methods according to the invention, and comprising; a recombined 
sequence,, which at. least contain two different, partial .sequences 
from, at least two different wild-type amylases ' • 

"■ " a recbmbinec/shuf f led laccase obtainable by • any of th. 
methods according to the invention, '. and comprising a recombinec 
sequence, which at least ..contain. ' two different partial sequence: 
from at least two dif f erent"wi.l d-type iaccases; 

a recombined/shuffled phytase obtainable by any of th 
method:; according to the . invent ion, and comprising a ,recombine 
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sequence, which at least contain two different partial sequences 
from at least two different wild-type phytases. 



DEFINITIONS 

Prior to discussing this invention in further detail, the 
following terms will first be defined. 

"Shuffling": The term "shuffling" means recombination of 
nucleotide sequence . fragment (s) between two or more homologous 
polynucleotides resulting in output polynucleotides (i.e. polynu- 
cleotides having been subjected to a shuffling cycle) having, a 
of nucleotide .fragments exchanged,' in -comparison to the 
polynucleotides (i.e. starting point homologous polynucleo- 



numbe: 
input 
tides! 



Homology of DMA sequences or polynucleotides" In the pres- 
ent context the degree of DMA sequence homology is determined:, as' 
the degree of identity between two sequences indicating a deriva- 
tion of the first' sequence" from the second. The homology may 
suitably be determined by means of computer programs known in the 
art, such, as GA? provided iri the GCG program package (Program 
Manual for the Wisconsin Package, Version 8, August 1994, Genet- 
ics Computer Group, ,'575 Science Drive, Madison, Wisconsin, USA, 

527i:)-(Needierr.an-, S.B.'arri Wunscrv, C D-. , 1- {.19.7.0 ):,... Journal ..of ...Mo-. 

lecular Biology, 48, 443-453). 

. "Homologous": The term "homologous" means that one single- 
stranded nucleic acid sequence may hybridize to a complementary 
single-stranded nucleic acid sequence. , The degree of hybridiza- 
tion may depend on' a number , of factors' including the amount of 
identity between the sequences and the hybridization conditions 
such as temperature and salt concentration as discussed " later 
(vide infra) . - . 
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Using the computer program GAP (vide supra) with the fol- 
lowing settings for DNA sequence comparison: GAP creation penally, 
of 5.0 and GAP extension penalty of 0.3, it is in the preser.t 
context believed that two DNA sequences will be able to hybridize 
5 (using low stringency hybridization conditions as defined below) 
if they- mutually exhibit -a degree of identity preferably of az 
least 70%, more preferably at least 80%, . and even more preferably 
at least 85%. 

••heterologous".:. If two or more DMA sequences mutually ex- 
10 hibit a degree of identity which is less than above specified, 
they are in the present context said to be "heterologous". 

"Hybridization:" Suitable experimental t conditions fcr 
determining if two or more DNA sequences of interest do hybridize 
or not is herein defined as hybridization at low stringency as 
15 ; described in detail belov;. . ( 

Molecules to which th.e .oligonucleotide probe hybridizes 
under these conditions are detected using a x-ray- film or a phos- 
phoimager. 

"primer": The term "primer" " used herein especially in cc:- 
20 nection with a PCR' reaction is'.,an oligonucleotide (especially a 
" PCR-primer" ) defined -and constructed according to general stan- 
dard, specifications known in the art . ("PCR A practical approach" 

. IRL Press, (1991) ) . • 

Vh primer directed to a sequence:" The 1 term "a* primer ci- 
25 rected to a sequence" means that the primer (preferably to he 
■ used in a PCR .reaction) is constructed ..to exhibit at least 80V 
' degree. of sequence identity to the sequence fragment of interest/ 
more preferably -at least 90% degree of sequence identity to the 
sequence fragment of interest, which said primer consequently is 
30 "directed to". The primer is designed to specifically anneal az 
the sequence fragment or region it is directed towards at a given 



WO 98/41653 n PCT/DK58/00104 

temperature. Especially identity at the 3' end of the primer is 
essential. 

"Flanking" The term "flanking" used herein in connection 
with DNA sequences comprised in a PCR-fragment means the outer- 
most partial sequences of the PCR-fragment, both in the 5' and 3* 

ends of the PCR fragment. 

"Polypeptide" Polymers of amino acids sometimes referred 
to as proteins. The ' sequence of amino acids determines the 
folded conformation that the polypeptide assumes, and this ir. 
turn determines biological properties and activity. Some 
polypeptides consist of a single polypeptide chain (monomeric) , 
whereas other comprise several associated polypeptides 
(multimeric) . All 'enzymes and antibodies are polypeptides. 

"Enzyme" A protein • capable of catalysing chemical reac- 
tions. Specific types of enzymes to be mentioned are such , as 
amylases, proteases, carbohydrases, lipases, cellulases, o.xi- 
doreductases, esterases, etc; Of specific interest in relation 
to the present invention are enzymes used in detergents, such as 
Droteases, lipases, cellulases, amylases, etc. 



DETAILED DESCRIPTION OF THE INVENTION 

.. .. T-he -present ..invention, relates in its first aspect to a 
method for the construction of a library of recombined homologous 
polynucleotides from a. number of different starting single or 
double stranded parental DNA templates and primers by induced 
template shifts during an in vitro polynucleotide synthesis using 
a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DMA template to 
Droduce single stranded templates, 
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b) annealing said primers to the single -stranded DNA te 
plates, 

■ c) extending said primers by initiating synthesis by use 
'said' polymerase, ' 

d) - cause' arrest, of ,the synthesis,, and 

e) denaturing the r double strand to separate the extend 
primers from the templates, 

B. . a template . shift is induced by 

- -a) isolating the .newly- synthesized single stranded extend: 
primers from ■ the templates . and repeating steps A. b) i 
A.e) using said .extended primers produced in- (A) as.'bc: 
' primers and templates, or . 
b) repeating steps A. b),- to -A . e ) , 

C. the- above process is" terminated after an appropriate numbe 
of cycles of process steps A. and 3. a), A. and 3.b), or ccz 
binations thereof, and 

D. optionally the produced polynucleotides' are. amplified .-in 
standard PCR reaction with specific primers to selective! 
amplify homologous polynucleotides of interest. 

Accordinc ' to the invention ' the polymerase may be a DNA or 
RNA polymerase, specific 1 polyme rases, to be mentioned are such £ 
DMA polymerases* .like ".T4- polymerase, T7 polymerase , E. coli.D! 
polymerase. I or the Klenow fragment of DNA polymerase I, or the: 
mostable polymerases such as T-aq, Ampiitaq, Vent, Pwo, 

. : * -* One of the advantages- of the invention is that it makes . i 
possible to control the length of the extension of, the primers i 
the reaction in a convenient manner. • 

This can be accomplished by various means such as choice 
polymerase, the physical and chemical conditions during the a 
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tion of the polymerase, e.g. pH, temperature, buffer, salt con- 
centration, and addition of various chemicals. 

It is known that various polymerases carry out the DNA syn- 
thesis at different rates (nucleotides incorporated pry second) . 
5 For example has the Klenow fragment of DNA polymerase I a limited 
extension rate compared to e.g. the Taq polymerase (Sambrook et 
al. 1989). 

Polymerases also display differences in processivity, which 
is the average number of nucleotides incorporated before the po- 
10- lymerase dissociates from the template/extended primer; again the 
Klenow polymerase is an example of a polymerase with limited 

processivity. ' . .. 

The choice of polymerase is therefore an important means in 
controlling the average extension of the primers. 
15 These conditions may also exert an influence on the fidel- 

ity of the polymerase (the race by which- point mutations are in- 
troduced; HIV reverse transcriptase is an example of a polymerase 
of low fidelity), a parameter useful in combining shuffling and 
mutagenesis . 

20 . In specific embodiments various modifications can ' be mace 

in the process of the invention. For example it is advantageous 
to apply a defective polymerase either an error-prone polymerase 
.to .introduce mutations, .in comparison to the templates, or a po- 
lymerase that will discontinue the polynucleotide synthesis pre- 

25 maturely to effect the arrest of the reaction. Such a defective 
polymerases that could be mentioned is a Klenow polymerase having 

low processivity. 

In another embodiment of the invention polymerase will be 
added after each cycle, if the polymerase used is not thermo- 
. 30 stable . 
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According to the invention the starting single or double 
stranded parental templates may be different in tha.t they contain 
.different' point mutations in the- same native polynucleotide 
(gene), or they can be homologous polynucleotides (genes) isz- 
5 lated from nature, which may be amplified by PCR, or they can be 
combinations thereof. The templates used in the process of the 
inventipn are hereby homologous showing an identity at the DN'A 
•level of e.g. more than 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60%,. 
-55%, or even more than 50% identity. - 

10 It may be advantageous to use pre-selected templates ccz- 

" prising mutations with improved properties of. interest. The pres- 
ent ' recombination method of the invention will then recombir.e 
said improved mutations for subsequent screening for even further 
improvements in the properties of interest. 

15 ' : " said pre-selected templates with improved properties of in- 

terest may have been identified by standard procedures in the arc 
comprising e.g. i) error-prone PCR of templates of interest fol- 
lowed by ii) screening/selection for templates with improved 
'characteristic' of interest. The mutagenesis frequency (low, or 

20 ' high mutagenesis frequency) of the error-prone PCR step is pref- 
er ably adjusted in relation to the subsequent screening capacity, 
; e _ j_f t he -screening caoacit y is limited' the error-prone ( ?CR 
freouency is preferably low (i.e. one to two mutations in each 
template) (see WO 92/18645 for further details). 

25 The arrest of the polymerase reaction in step A. d) may as 1 

indicated above be obtained in different ways, such as by raising 
the temperature, or adding specific reagents as described in WO 

95/17413. , 

When raising the temperature for this purpose, it is pre- 

30 f erred to use temperatures between 90°C and 99°C. 



WO 98/41653 15 PCT/DK9S/00104 

When using chemical agents DMSO is a possibility. Appropri- 
ate procedures are mentioned in e.g. WO 95/17413. 

The process of the invention uses annealing of the primers 
to the templates in step l.b. In this context the annealing may 
5 be random or specific, meaning either anywhere on the polynucleo- 
tide or at a specific position depending on the nature of the 
primer. 

Also, the primers to be used, may be completely random prim- 
ers (NNNNNNNNNNN) (N meaning a mixture of the four bases (A, T, 
10 G, C) is used at a particular position in the primer during syn- 
thesis) , semi-random primers, or specific primers. 

If the extended primers produced are to be separated from 
the -primers .during the process it is convenient to use labeled 
texolates in order to provide a simple means for separation, a 
15 preferred marker is biotin or digoxigenin. 

•According to the invention the number of cycles necessary 
will be less than 500, in most cases less than 200, and normally 
less than 100 cycles. 

In an' embodiment of the invention the above in vitro shur- 
20 flinc is combined with a subsequent in vivo shuffling by methods 
such as those described in WO "97/07205. 

In its second aspect' the invention relates to a method or 
identifying polvpeotides exhibiting improved properties in con- ■ 
parison to naturally occurring polypeptides of the same bioactiv- 
25 ity, whereby a library of recombined homologous polynucleotides 
produced by the above process are cloned into an appropriate vec- 
tor, said vector is then transformed into a suitable host system, 
to be expressed into the corresponding polypeptides, said 
polypeptides are then screened in a suitable assay, and positive 
30 polypeptides selected.. 
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In an embodiment it is contemplated that the polypeptides 
of interest encoded by the shuffled • library are expressed as a 
; --suitable fusion protein (e.g. as a hybrid with gill of bac- 
teriphage M13/fd) in order to display said recombined polypeptide 
5 on the surface of phage or bacteria. 

In a third aspect- the invention relates . to a method fcr 
producing a polypeptide ' of interest as . identified in the preced- 
ing process, ' whereby." a vector comprising a polynucleotide encod- 
ing ,sa'id polypeptide is transformed into a suitable * host , sail 
10" host is grown to express said polypeptide,; and the polypeptide 
recovered and; purified . 

The use of -partial random (semi-random) or completely r.ar.- 
don primers, (mixtures of bases in a .selected number or all posi- 
tions in the primer); as initiation point for the ' DMA synthesis' 
15 provide certain novel possibilities for, the 'combined use of shuf- 
fling and random mutagenesis . /; /, ' ■ ; 

It is often a'ssociated with" difficulties to obtain an ir. 
vitro recombination of polynucleotides- that .display relatively 
. . ■ 'limited homology.' By the use of , an embodiment . ' of the inventicr; 
20 even verv .diverse .polynucleotides can be forced to recombine. 

According to -that embodiment at • least two templates (or two- 
pools of diverse ■ templates) * are , -applied. -The', novel synthesis of. 
the one polynucleotide can then be based on only one strand (i.e. 
■ either the sense or the anti-sense strand), , and the synthesis c: 
25. the other polynucleotide is based the opposite strand. . 

This can- • be accomplished by isolating ' the complementary 
strands from the two templates, .e.g. by having these strands . la- 
beled by biotin. Synthesis of- DNA is initiated' by annealing ei- 
ther specific, partly random or completely random primers 'to 
30- these templates and adding a suitable, polymerase. This can be 
performed as either separate reactions for the different ten- 
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plates or in just one reaction. Synthesis should preferably be 
performed under conditions that, favor production of relatively 
short new fragments. These fragments can subsequently be isolated 
from the templates based on the affinity label. A PCR reaction is 
carried out on these fragments and as the starting material 
originates from two different strands, the newly synthesized 
fragments must recombine in order to produce full length PCR 
products - a kind of forced recombination. 

Also for this embodiment rapid PCR with short or no exten- 
sion time can be applied advantageously in order to enhance re- 
combination, especially if pools of templates are used for the 

two strands. . • 

The length 'of the primer and the annealing temperature 
utilized in the' process determines if random primers will anneal 
and the number of mismatches between the ■ template and the primer 
that can be accommodated. By varying the primer length and the 
annealing temperature the method of the invention provides, a 
means for achieving ranco.T. mutagenesis within .a certain nucleo- 
tide window representing the length of the primer. The method- of 
the invention thereby provides substantial benefits compared to 
other random mutagenesis approaches, especially the high prob- 
ability for several base substitutions close to .each other in the 
primary sequence, e.g. the use of a completely random 20'mer 
(mixture of all four nucleotides in all 20 positions) will ac- 
cording to theory under given experimental conditions give a' cer- 
tain reasonably high probability for having several base substi- 
tutions close to each other. 

Error prone PCR (-high, mutagenesis frequency PCR) does not 
provide this possibility. 'Error prone PCR provides a very low 
probability for having more than one base substitution within one 
codon (coding for one amino acid ir. a translated polypeptide) . 
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Obviously the substitution of only, one base within a codo 
doesn't provide total random mutagenesis (at protein level) a 
only a limited set of amino acid substitutions can be obtained fa- 
one base substitution at DNA level (e.g. Methionine encoded 
ATG-codon requires three base substitution to become the TGT or 
TGC-codon encoding Cysteine). - 

-In one embodiment of the invention the process is therefore 
performed by using random or semi-random primers having a length 
of from 6 to 200 bp, preferably from 10 to 70 bp, and. better from 
15 to 40 bp. 

One of the advantages in the method of the invention is' the 
robustness. In some embodiments the- constant presence of full 
length template provides a further advantage avoiding FCR con- 
tamination problems. Furthermore it is much less laborius, with 
less hands- on, than other described methods,, thereby providing 
excellent possibilities for automation. 

PCR-primers : 

■ t The PCR primers are constructed according to the standard 
descriptions in the art.- Normally . they are 10-75 base-pairs (bp! 
long. However, for- the specific embodiment . using random cr semi- 
random .primers the length may -be substantially longer as indi- 
cated above. 

PCR-reactions : 

.If not ' otherwise mentioned the PCR-reaction performed ac- 
cording to the invention are performed according to standard pro- 
tocols known in the art. 

The term "Isolation of fragment" is intended to cover 

as broad as simply an aliquot -obtaining the PCR fragment. How- 
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ever preferably the PCR fragment is isolated to an extend which 
remove surplus of primers, nucleotides templates etc . . 

In an embodiment of the invention the DNA fragment (s) 
is (are) prepared under conditions resulting in a low, medium or 
5 high random mutagenesis frequency. 

To obtain low mutagenesis frequency the DNA sequence (s) 
(comprising the DNA fragment (s) ) may be prepared by a standard 
PCR amplification method (US 4,683,202 or Saiki et al., (1988), 
Science 239, 487 - 491) . 

10 A medium or high mutagenesis frequency may be obtained by 

performing the PCR amplification under conditions which increase 
the misincorporation of nucleotides, for. instance as described by 
Deshler, (1992), G ATA 9(4), 103-106; Leung et al., (1989), Tech- 
nique, Vol . 1, No. 1, 11-15. 

15 it is also contemplated according to the invention to com- 

bine the PCR amplification (i.e. according to this embodiment 
also DNA fragment mutation) with a mutagenesis ' step using a suit- 
able physical or chemical mutagenizing agent, e.g., one which in- 
duces transitions, transversions, inversions, scrambling, dele- 

20 tions, and/or insertions. 

Sxoressinq the recombinant protein from the recombinant shuffled 

., sequences . ...... ........ 

Expression of the recombinant protein encoded by the shuf- 
25 fled sequence in accordance with the second and third aspect or 
the present invention may be performed, by . use of standard expres- 
sion vectors and corresponding expression systems known in the 
art . 

30 Screening and selection 
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In the context of the present invention the term "positr 
polypeptide variants" means resulting polypeptide variants po. 
sessing functional properties which has been improved in compar: 
son to the polypeptides producible from the corresponding inp-_ 
5- DNA sequences. Examples, of such improved .properties can be = 
different as e.g. enhance or lowered biological activity,, ir 
creased wash performance, thermostability, oxidation stability 
substrate specificity, antibiotic resistance etc. 

Consequently, the screening method to be used for identify 
10 ing positive variants depend on which property of the polypeptic 
in question it is desired to change, and in what direction tr. 
change is desired. • 

A number ' of suitable screening or selection systems t 
screen or select for a desired biological activity are describe 
15 .in the art. Examples are: 

Straoberg et .al. (Biotechnology 13: 669-673 (.1995). ce 
scribes a screening system for subtilisin variants having Cal 
cium-independent stability; 

Bryan et al. '(Proteins- 1:326-334 (1986)) describes 
20 screening assay for proteases having a enhanced thermal stabil 
ity; and 

PCT-DK96/00322'- describes a screening assay for lipases ha~ 
• ing improved, wash performance in washing detergents. 

An embodiment of the invention comprises screening or si 
25' - lection of reco.mbinant • protein (s) , wherein the desired . biologic; 
activity is performance in dish-wash or laundry detergents.- Exa: 
ples of suitable dish-wash or laundry detergents are disclosed 
PCT-DK96/00322 and WO 95/30011. 

If, for instance, the polypeptide in question is an enzy 
30 and .the desired improved functional property is the wash perfor 
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ance, the screening may conveniently be performed by use of a 
filter assay based on the following principle: 

The recombination host cell is incubated on a suitable me- 
dium and under suitable conditions for the enzyme to be secreted, 
the medium being provided with a double filter comprising a first 
protein-binding filter and on top of that a second filter exhib- 
iting a low protein binding capability . The recombination host 
cell is located on the second filter. Subsequent to the incuba- 
tion, the first filter comprising the enzyme secreted from the 
recombination host ceil is separated * from the second filter com- 
prising said cells. The first filter is subjected to screening 
for' the desired enzymatic activity and the - corresponding micro- 
bial colonies present on the second filter are identified. 

The filter used for binding the enzymatic activity may , be 
15 any protein binding filter e.g. nylon or nitrocellulose. The top- 
filter carrying the colonies of the expression organism may be 
any filter that has no or low affinity for binding proteins e.g. 
cellulose acetate or Durapore®. The filter may be pre-treated 
with any of the conditions to be used for screening or may - be 
20 treated curing the detection of enzymatic activity. 

The enzymatic activity may be detected by a dye, fluores- 
cence, precipitation, pH indicator, IR-absorbance or a.ny other 

" ' 'known" technique -for- detect ion of- -enzymatic activity-.. 

The detecting compound may be immobilized by any immobiliz- 
es ing agent e.g. agarose, agar, gelatin, polyacrylamice, starch, 
filter paper, cloth; or any combination of immobilizing agents. 

If the improved functional property of the polypeptide* is 
not sufficiently good after one cycle of shuf-fling, the polypep- 
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30 -in an. embodiment of the invention wherein homologous 

ooivnucleotides representing* a number of mutations of the same 
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gene is used as templates at least one shuffling cycle is a back- 
crossing cycle with the initially used DNA fragment, which may be 
the wild-type DNA .fragment* This eliminates non-essential muta- 
tions. Non-essential mutations may .also be eliminated by using 
5 wild-type DNA fragments as the .initially used input DNA material. 

Also contemplated to be within the invention is polypep- 
tides having biological activity such as insulin, ACTH," glucagon, 
somatostatin, somatotropin, thymosin/ parathyroid hormone, pitu- 
. ary . hormones, somatomedin, erythropoietin, luteinizing hormone, 
10 chorionic gonadotropin, hypothalamic releasing factors, antidiu- 
retic hormones, thyroid stimulating hormone, relaxin, interferon, 
thrombopoeitin (TPO) and prolactin. _ • ■ ■ 

A requirement to. the startling parental DNA sequences, en- 
coding the polypeptide (s) , to be shuffled, is that they are az 
15 least 50%, 60%, 70%, 80%, 90%, or 95% homologous. DNA sequences 
being less homologous will have less inclination to interact and 
recombine . 

* ' It is also contemplated according to the invention to shuf- 

fle parental polynucleotides that are homologous as indicated 
20' above originating fro™ wile type organisms of different genera. 

Further, the starting parental templates to be shuffled* may 
preferably have a length of from about 50 'bp to 20 kb, preferably 
about 100 bp to 10 kb, more- preferred about 200 bp to 7 kb, espe- 
cially about 400 bp to 2 kb . 
25 , The starting parental DNA sequences may be any ' DNA se- 

. quences including wild- type DNA sequences, DNA sequences encoding 
variants or mutants, or modifications thereof, such as extended 
or elongated DNA sequences, and may also be the outcome of DNA 
sequences having been subjected to one or more cycles of shuffl- 
30 ing (i.e. output DNA sequences) according to 'the method of the 
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invention or any other method (e.g. any of the methods described 
in the prior art section) . 

When using the method of the invention the resulting recor:- 
bined homologous polynucleotides (i.e. shuffled DMA sequences), 
5 have had a number of nucleotide fragments exchanged. This results 
in replacement of at least one amino acid within the polypeptide 
variant, if. comparing it with the parent polypeptide. It is to be 
understood that also silent exchanges are contemplated (i.e. nu- 

■ cleotide exchange which does not result in changes in the amino 
10 acid sequence) . 

MATERIALS AND METHODS 

Specific Method Used in the Examples: 

A) DNA encoding different enzyme variants of the same gene or 
15 different enzymes having the same type of activity encoded*-- by 

homologous genes are nixed. The DNA is provided as either PCR 
fragments, plasmid, phage or cer.omic DMA. 

B) The resulting pool of D:. : A is mixed with DNA Polymerase', 

' dNT?, a suitable buffer and primers (being either random oii- 
20 gomers (length of 6-30 nucleotides) or specific oligomers 
(length of 6-50 nucleotides) or a combination of both types).' 

C) The ?CR mixture is out - into a PCR thermocycler (either 
- cold or hot) - in .a -suitable tube. ....... 

cl) The thermocycier is heated to a temperature of 90- 
25 100°C for a period of time ( typically 1-10 ruin) in order to dena- 

■ ture the DMA templates. 

c2) Thereafter the following procedure (cycle) is fol- 
lowed (repeated) :. The template is denatured (typically 90-100°C 
for 0-5 minutes) . Then the temperature is lowered (typically .tc 
30 a value between 10°C and 90 C C for 0-5 minutes) to allow annealing 
of the primer to the sincie stranded template. Nov/ the tempera- 
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ture Is raised again to- dena.turation temperature (90-100°C) al- 
lowing small extension of the primer to be synthesized by the 
DNA polymerase- during ramping. Alternatively a short extensic.- 
period (typically 0-30 seconds at 70-75°C) can be introduced zz 
"5 allow larger extensions of the primers to be generated. When* the 
temperature . reaches a value where denaturation takes place, the 
/extended primers and templates are again "separated. This proce- 
dure can be repeated ( typically' between ,1 to 99 cycles) . 
' .D) Having performed' the 1 desired" "number of cycles the gener- 
ic ated small DMA polymers can be purified from the oligomers use: 
as primers. One way is to isolate and clone a specif ic amplified 
band containing the gene "coding for the polypeptide of interest 
into a. suitable vector.. This can be done either on agarose gel 
(typically isolating fragments between 50 to 1000- base pairs;, 
15 \ by beads (using an affinity label, on either templates or prin- 
ers) or through columns. 

■E) Then -the purified (or the not . purified) DNA polymers c = r. 
be .assembled in a standard -PGR reaction (for instance 94°C, 5 
minutes, '.(94°C, 30 sec,^ 55°C, *30 sec; 72°C,: 2' rr.in) * '2 5., 72°C 5 

20 minutes, 4°C) . . * .... , 

. ' ' Specific • primers or "DNA colters generated by' specific, 

primers, can .be added 'in order to generate a specific DNA polymer 

containing the eerie cf interest: . This As "mentioned in point D, 

tnis DNA polymer can be purified and cloned into a vector of 'in- 

2 5 terest . . • . . . . - _ ; _ ' ' _ ■ : , ' , ..... 

EXAMPLES ' 
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Method 1 . 

The strong advantage: of the method exemplified here is the 
robustness and lack of PCR contamination problems, due to the 
constant presence of parental template. Furthermore this method 
is less labor demanding than methods, described in the prior art, 
thereby providing excellent possibilities for automation. , 

Nine different plasmids containing DNA sequences encoding 
9 different variants of the H. lanuginosa lipase gene, were 
mixed in .equimolar amounts. The variant genes contained from two 
to seven mutations scattered throughout the gene. 

The DNA sequence of the H. lanuginosa lipase gene and the 
amino acid sequence of the lipase are disclosed in EP 0 305 216 

The variants' are indicated according to standard terminol- 
ogy as described in e.g. EP 0 396 608, and WO 97/07202. 

The following 9 variant genes were shuffled: . ; 



1 ' . N94K+D96L+E99K 

2' .. SP?RRP+N94K+D96L+T23.IR+N233?.+-D234R+Q24-9R 
0- 3'. S??R?.?-+A19C+C36rt+N34rHD9SL-rQ2 4SR ' ' ■ • 

4'.-. STPRRP + N94R ' 
" 5' .. SCIRRrN94K+D96L-f£239C-rQ24SR 

6' . DI37G+D167G+E2I0V 
: 7.'. D95L + E99K-^Vi87A 
>5 8'. SP?RRP+D57G+N94K+D9SL*Q249R 

9' . N94R-S-F95L ' ' . . ' ' 

The following components where mixed , in a microtube: 

2 nl plasmid mixture ( C . 1 5 ug/ul ) , specif ic primers flar.k- 
30- ing-the gene (1 pmol/ul) , 2'ui 2.5 mM cNTP, 2. 5 m>l MgCl : , 2 ul 
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10* taq buffer (Perkin Elmer), 0.5 ul taq enzyme in a total vol- 
ume of 20 ul. 

The tube , was set in a Perkin Elmer 2400 thermocycler The 
following, PCR-program was run:(94°C, 5 minutes) 1' cycle: 
(94°C , 30 seconds, 70°C, 0 seconds) 99 cycles(72°C, 2 minutes,' 

4°C indefinite). 1 cycle ., " ... •' 

The PGR-reaction was run on a 1.5 % agarose gel. A DNA- 
band of the specific expected . size was cut out of the agarose 
gel and purified using JETsorb (from GENOMED Inc.). The purified 
?CR-product was cloned into a TA-vector (from' Invitrogen (the 
original TA cloning kit).- The ligated product was transf o'rmed 
into a standard Escherichia coli strain (DH5a). 

20 transformants where fully sequenced across the gene of 
interest . 



Result: 

■The following 20 variants were. found: 

1. D137GrD!6:7G + E210VtY213C 

2. S???.R?+D57G-rM.94K+D96LrQ249?> • 
20 3-. N94R+F95L 

A. S??R.?.?-rD137G-j-DI67G-!-E210V 

5 . N 9 4 K r D 9 ' 6 L E 9 9 K + V 1 3 7 A - T 2 5 7 1 

5, D137G+D167G+E210V 

7. N 9 4 K+ D96UE99 K+ Vi 8 7 A 
25. 8.. D57G+.M94R+F95L+Q249R ■ 
■ 9. N94K+D96L+E99K+E210V 

10. S?PRR?+A19C.+C36A+N94KrDScL 

11 ..' K34R-5-F95L 

12. D137GtD167G+E210V 
30 13." N94K+D96L+Q249R 
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14 . SPPRRP+Q15P+A19C+C36A+N94K+D96L 

15. SPPRRP+N94K+D96L+T231R+N233R+D234R+Q249R 

16. D137G+D167G+E210V 

17. SCIRR+N94K+D96L+Q249R 
5 18. N94K+D96L+E99K 

19. N94R+F95L 

20. SPPRRP+N94R+F95L+F113S+Q24 9R 

* Nearly all mutations where represented (19 of 20) indicat- 
10 ing little bias for specific templates. 
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Statistics: 


Not shuffled 




10 


Shuffled between 


at least 2 templates 


8 


Shuffled between 


at least 3 templates 


2 



The shuffled sequences can then be subcloned from the E.coli TA 
"vector into the yeast vector pJS026 as a BanHI-Xbal fragment 
15 (see WO 97/07205), and e.g. screened for new shuffled sequences 
with improved performance in .detergents (see WO 97/07205). 



EXAMPLE . 2 
Method 2: 



PCR products of 10 different lipase variant genes were 
generated as above and pooled. in equimolar. amounts . 



The following 10 mutant genes were shuffled. 

1*. D137G+D167G+E210V 

25' 2'. D96L+E99K+V187A 

3' . -N94K+D96L+E99K 
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4' . SPPRRP+D57S+N94K+D96L+Q249-R ' 

5'. D111N+F21.1A+G225P \ 

• 6' . SPPRRP+N-94K+D96L+T231R+N233R+D234R+Q249R' 

.7 • . SPPRRP+A19C+C3 6A+N94K+D96L+Q24 9R. 

5 8 ' . STPRRP+N94R . 

9' . N94R+F95L 

t 10'. SCIRR+N94K+D96L+E239C+Q249R ■ - ' 

t-h-e following mixture' was generated in a .suitable tube : 
10 1 |il PCR mixture, (Oil. '.fig)-, decanter- random .- prir.ex (3CO 

pmol), 2 jal 10* .Klenow' buffer- (Promega), "0.25 *mM dNT?, • 2.5 rJ-{ 
MgClv in a total volume of -20 jil . " , ; ' ' * . 

The mixture was"; set in a PS2400 thermocycler where the 
..following program was run: . 96°C, 5 minutes, 25°C 5 minutes, 0.5 
15 ml . Klenow. enzyme was added, 25°C'' 60/minutes, 35°C 90 minutes. 

This procedure generated a high number of small . DMA poly- 
mers originating from all parts of the' : gene 

10 jil was taken out for test on agarose gel. 
10 "pi PGR mixture (0.25 mM-' dNTP, 1 ■ |il -10* Taq buffer 
20 "(Per kin Elmer), 2.5 rr.X : : :;C1 2/ 0 . 5 \xh Taq enzyr:e) was added t.o 
• .. -the -'10 |il in the tube in the thermocycler. Then the following 
'"■■'standard PCR-program was run-. (94°C,. 5 minutes) ■ 1 cycle, (94°C 53 
' < seconds, 45°C,. 30 ' seconds, -72°C 30 seconds) 25 cycles, 72°C *«7. 

' • ' minutes, 4°C indefinite , . ' 
25* ■ ■■ The PCR products were run on a . 1 . 5% .agarose gel. A clear 

unbiased smear was seen. DMA between 400 and 800 bp was isolated 
from the gel . 

Half of the purified.?::-', product was mixed in . a tube with 
two 'specific primers. (40 cmcl) flanking the gene . of interest, 
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0-.2 5 mM dNTP, 2 \xl 10* Taq buffer, 2.5 rruM* MgCl 2 . Then the fol- 
lowing standard PCR-program was run: (94°C , 5 minutes) 1 cycle, 
(94°C 30 seconds, 50°C, 30 seconds, 72°C . 30 seconds) 25 cycles, 
72°C 7 minutes, 4°C indefinite. 
5 The PCR product was run on a 1.5% agarose gel. A specific 

though weak band of the expected size was isolated. Additional 
PCR was run using specific primers (as mentioned above) in order 
to amplify the PCR-product before cloning. 

The PCR-product and the desired vector were cut with the 

10 appropriate restriction enzymes (3am>iI/XhoI ) . The vector and the 
PCR product were run on a 1.5% agarose^ gel, and purified fro- 
the gel. ^ 

The cut PCR-prpcuct and the cut vector were mixed in a li- 
gase buffer with T4 DNA ligase (Promega). After overnight 1-iga- 

15 . tion at 1 6°C the mixture was transformed into E. coli strain 
DH5a. 

19 clones we.re fully sequenced across the gene. 
Result : 

20 The follov/ing 19 variants were found: ; 

ST?RR? + N9 4?.TN2 3 3R-i-D2 3 4RfQ2 4 9R 
SPPRRP+N2 33R+D2 3 4R+Q2 4 9R 
D96L+D167G+Q249R 
N 9 4 R 
D167G 

SPPRR? + A19C + A2ST-i-N94KTD9 5LrDlllM + £2 39C 
SPPRR?+A19C+C3 6A+N9 4KtD5SL+Q2 4 9R 
N94KrD96L - 
N2 5T + D57G + N94R+E9 9K^D167G-^T2 31R^N2 3 3R^D2 3 4R+Q2 4 9R 



1 . 

2 . 
3. 

-25 

5 . 
6. 

7 . 

8 . 

30 9. 



WO 98/41653 



30 



PCT/DK98/00104 



10. N94R+Q249R 

11. D167G 

12. D167G ' 

13. "-T32I+N94R+F95L+D167G+Q24 9R 

5 14 . E87K+N94K+D96L * . ' 

' 15. N94R+F95L+Q249R 

16. N94K+D96L+D11IN 

17. STPRRP+S17T 

1-8. N94K+D96L+V187A 

10 19. SPPRRP+D57G+N94K+D96L+D111N+L151S 

All template variants were represented indicating littl: 
bias for specif ic 'templates . 

There were no apparent hot spots ■ with regard to-mutatio: 
15 exchange and it seems to be evenly distributed * along the gene 
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Statistics : 



Not shuffled 










1 




Shuffled between 


a - t 


least 


2 


templates 


10 ■ 




Shu f fled between 


at 


least 


3 


templates 


6 




Shuffled between 


at 


least 




templates 


1 




Shuffled between 


at 


least 


5 


templates 


0 




. Shuffled between 


a t 


least 


6 


templates 


1 




The .shuffled 


se 


que no 


es 


can then 


be subcloned' 


from the 


'.coli TA vector i 


nto 


the 


ye 


ast vector 


pJS026 as a 


BamHI-Xbal 



fragment (see WO 97/07205), and 'e.g.,' screened for new shuffled 
sequences with improved performance in detergents (see WO 
97/07205) . 
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Example 3 : 

Amylase variant shuffling: 

5 In Example 1, it was shown how a number of multiple vari- 

ants of H. lanuginosa lipase were shuffled. In a similar manner, 
variants of Bacillus a-amylases can be shuffled. 

Earlier patent applications have identified variants of 
various a-amylases from Bacillus species improved for particular 
10 properties, e.g. thermostability, stability under Calcium- . 
depleted conditions, improved wash-performance etc. (see 
WO95/10S03, W09.6/23874, W096/23873, and PCT/DK97/00197 ) . 

Variants of B. licheniformis a-amylase amyL can be shuf- 
fled as follows. The variants are all located in- the B. svbtilis 
15 expression vector pDN1528 described in WO95/10603. ' r 

■ The experiment is carried out under the exact same condi- 
tions as Example 1 except that the flanking 27mer primers used 
to initiate DMA synthesis were different. 

The ?CR amplified band of approximately 1500 bp is puri- 
20 f ied from an agarose gel and cloned as described in Exarr.plel . 

Alternatively restriction sites located within. the amyL gene car. 
be utilized to clone the library of shuffled genes into either 
' Bacillus plasmid pDN1528"" or an Z. coli vector" containing -the- - 
wild type amyL gene, e.g. pJeENl described in W096/23874 

25- 
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Shuffling. of two genes encoding homologous a-amylases: 
' amyL and the amylase identified by SEQ.ID no2 (amino, acid) and 
SEQ.ID no. 5 (DMA) described in W096/23873. 

5 The forward strand (identical to the mRNA). of amyL can be ampli- 
' fled ,in'a PGR using standard conditions. 
■ The. forward strand is separated from the reverse strand 

based on its affinity to streptavidin coated magnetic beads and 
... . .denaturation of' the ..two -strands with NaQH. ■ 
10'* Similarly the reverse strand (complementary to mRNA) of the amy- 
lase encoded by SEQ^ID no5 (W096/23873) can .be • amplified and 
. isolated' . .-. 

Two primer strands are used as templates in a .PCR: (94 °C 5, 
minutes) + : 99- cycles of (94°C, 30 seconds; 60°C, 0 seconds) +• 
15 (72°C, ' 5. minutes)' using, random' "primers of various, lengths, Taq 
■ polymerase and standard buffer conditions as described in Exam- 
'. pie 1 . ■" - . , ; , ■ 

.'. The resulting approximately 1500 bp product is cloned ei- 
ther»'as''TA cloning as described in' Example one (for verification 
20 of the sequence of- the .resulting- -clone) or-into Bacillus vector, 
pTVai.l.O utilizing Sfil and ?stl restriction sites. 

1 —The. original template can be removed from the PCR at 'any ■ 
steD (e.g. after' 5, 10 or 20 cycles) based on the biotin tag). 
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PATENT CLAIMS 

1. a method for the construction of a library of recombined 
homologous polynucleotides from a number of different starting 
single or double stranded parental DNA templates and primers' by 
induced template shifts during an in vitro polynucleotide syn- 
thesis using a polymerase, whereby 

A. extended primers or polynucleotides are synthesized by 

a) denaturing a parental double stranded DNA template to 
produce single stranded templates, 

b) ■ annealing said primers to the single stranded DNA 

templates, 

c) extending said primers by * initiating synthesis by use 
of said polymerase, 

d) cause arrest of the synthesis, and • 

e) * denaturing the double strand to separate the extended 

primers from the templates, 
3. a template shift is induced by 

a)- isolating the newly synthesized single stranded ex- 
tended primers f rem the templates and repeating steps 
A.b) to A.e) usir.c said extended primers produced in. 
(A) as both primers and templates, or. 
.b) .. .repeating steps A_._b) to A.e) , 

C. the above process is terminated after an appropriate num- 
ber of cycles of process steps A. and B.a), A. and 3.b), 
or combinations thereof; and 

D. optionally the produced polynucleotides are amplified in a 
standard PCR reaction with specific primers to selectively 
amplifv homologous polynucleotides of interest., 

2 The method of claim 1, wherein said polymerase is a ther- 

mostable DNA polymerase, such as Taq, Amplitaq, Vent, Pwo. 
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3. The method of claim 1, wherein said polymerase is a DNA 
polymerase/ such as T4 polymerase, T7 polymerase , E. coli DNA 
polymerase I or the Klenow fragment of DNA polymerase I, and 

5 said polymerase is added to the reaction mixture after each cy- 
cle. 

4. The method of claim 1, wherein said polymerase is a RNA 
polymerase./ ' • 

10 

5. The method of any of the claims Ito 4, wherein said po- 
lymerase is an error-prone polymerase, such as a reverse tran- 
scriptases like HIV' reverse transcriptase. 

15 6. The method of any of the claims 1 to 5, wherein said 
' starting sinigle or double stranded parental templates are dif- 
ferent in that they contain different point mutations in the 
. -same native polynucleotide (gene), or are . homologous polynu- 
cleotides (genes) isolated' 'from nature. * 

20 . ' ^ 

7. The 'method of clairr. 6, wherein 'said starting single or 
double stranded' parental templates exhibit an identity at 'the. 
DNA level of e.g. more than 95%, 90% , 85%', " 80% , 7 5%,, 70%, 65%/ 
60%, 55%, or even more than 50% identity.. • . 

8. " The .method 'of any of the "claims- 1 to 7, wherein the ar- 
rest' of the polymerase 'reaction in step A.d) is 'achieved' by" 
raising the temperature. . . ■ . 
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9. .The method of claim 8, wherein the temperature is . raised 
to a value between 90°C and 99°C, preferably 94°C. 
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10. The method of any of the claims 1 to 7, wherein the ar- 
rest of the polymerase reaction in step l.d is achieved by add- 
ing specific reagents, such as DMSO. 

11. The method of any of the claims 1 to 10,' wherein said 
primers comprise a population of completely random primers. 

12. The method of any of the claims 1 to 10, wherein said 
primers comprise a population of semi -random primers. 

13. : The method of any of the . claims 1 to 10, wherein said 
primers comprise a population of specific primers. 

14. The method of any of the claims 1 to 13, wherein said 
primers are labeled, preferably .with biotin or digoxigenin. ^ 

15. The method of any of the claims 11, 12 or 14, wherein 
said random or semi-random primers have a length of from 6 to 
500 b?,' preferably from 15 to 300 bp, and better from 25 to 150 

'bp. 

- 15. -. The- method of. .any of the claims 13 or 14 , wherein said^ 
primers are 6 to 7 5 bp long. 

17. The method of any of the claims 1 to 16, wherein said 
starting parental DN.V templates are at least 50%, 60%, 70%, 
80%, 90%, or 95% homologous. 

18. ..The method of claim 17, wherein said starting parental 
DMA templates originate from wild type organisms of different 
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19. • The method of any of the claims 17 or 18, wherein said 
starting "parental templates have a length of from about , 50 bp 
to 20 kb, preferably about 100 bp to 10 kb, more preferred 
5 about 200 bp to 7 kb, especially about 400 bp to 2 kb 

. 20. The method- of any of - the claims 1 to- 19, wherein said 

"starting parental templates are linear DMA fragments generated 

, by a PGR reaction . , 

10 , ; ■ . : '"' 

21. The method of any of .the .claims 1 to 20/ wherein said 
starting parental templates are. cloned ' into suitable vectors, 

. such as a plasmid.- . ' . . . 

15 22.. ■ The' method of any of the' claims 1 to ,21, wherein- said 
starting parental templates represent a polynucleotide encoding 
an enzyme, such as a. 'carbonyl hydrolase, carbohydrase, or es- 
terase, such as. a protease, lipase, amylase, cellulase, oxi- 
dase,, ox.ido reductase, etc. .' . • . , 

2 0 - ' " " ' - * " . ••' ■ ' 

23. The method of - any. of . the claims . 1 to 21, 1 wherein ( said 
starting parental templates' represent 'a . polynucleotide' encoding 
a. polypeptide haying biological activity, such as insulin, 
ACTH, glucagon, 'somatostatin, somatotropin, thymosin, parathy- 

25 \ roid hormone,, pituary ■ hormones, somatomedin, erythropoietin, 

/ "luteinizing- hormone, * chorionic gonadotropin,, .hypothalamic 're- 
leasing ' factors/ antidiuretic hormones-, , thyroid stimulating 
hormone, relaxin/ interferon, thrombopoeitin (TPCV) and prolac- 
tin. 

30 

24. The method of any of the claims 1 to 21, wherein said 
starting parental templates, represent a polynucleotide contain- 
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irig a biological function such as transcription initiation 
(promoter) or termination, translatiorial initiation, operator 
* sites related to expression of genes 

5 25; The method of any of the claims 1 to 24, wherein, said 
number of cycles are less than 500, 200; or less than 100 cy- 
• cles . 

26. A method of identifying a polypeptide of interest exhib- 
10 iting improved properties in comparison to naturally occurring 
or other known polypeptides of the same activity, whereby, a li- 
brary of recombined homologous .polynucleotides produced by a 
process according to any. of the claims 1 to 25 are cloned into 
an appropriate vector, said- vector is transformed into a suit- 
15 able host system, to be expressed into the corresponding, 
polypeptides and displayed, and said polypeptide , is screened in 
a suitable assay, and positive polypeptides selected. >' 

27.. A method for producing a polypeptide of interest as iden- 
20 tified according to claim 26, whereby a vector' comprising a 
- polynucleotide encoding said identified polypeptide is' trans- 
formed into a suitable host, said host is grown to express said. 
... polypeptide, ...and „the _polype.pt ide...,recp;v'ered_a_nd .purified 
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